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Dynamical decoupling can be used to preserve arbitrary quantum states despite undesired inter- 
actions with the environment, using control Hamiltonians affecting the system only. We present 
a system-independent analysis of dynamical decoupling based on leading order decoupling error 
estimates, valid for bounded-strength environments. Using as a key tool a renormalization trans- 
formation of the effective system-bath coupling Hamiltonian, we delineate the reliability domain of 
dynamical decoupling used for quantum state preservation, in a general setting for a single qubit. 
We specifically analyze and compare two deterministic dynamical decoupling schemes - periodic 
and concatenated - and distinguish between two limiting cases of fast versus slow environments. 
We prove that concatenated decoupling outperforms periodic decoupling over a wide range of pa- 
rameters. These results are obtained for both "ideal" (zero-width) and realistic (finite-width) pulses 
This work extends and generalizes our earlier work, Phys. Rev. Lett. 95, 180501 (2005). 
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I. INTRODUCTION 

Arbitrary quantum state preservation is a fundamental 
imperative for quantum information processing, but un- 
desired interactions of a candidate quantum system with 
uncontrollable external systems (the environment/bath) 
results in poor control and fidelity loss. Even if the 
structure of these interactions is approximately known, 
the statistical uncertainty in the state of the environ- 
ment invariably results in decoherence While unde- 
sired couplings to the environment are inevitable (even 
at zero temperature) , strong control fields applied to the 
system can be used to effectively manipulate the cou- 
plings. Nuclear magnetic resonance is an excellent ex- 
ample where techniques such as refocusing and compos- 
ite pulses are readily used to generate reliable and high 
precision quantum dynamics Q ■ Similar in execution 
but applicable in a generic setting, dynamical decoupling 
(DD) is a method for the effective renormalization of 
the system-bath interaction Hamiltonian via the appli- 
cation of strong system-control fields. Usually the goal 
is the cancellation of all coupling terms. In the context 
of quantum information processing, DD can be used for 
feedback-free quantum error suppression without encod- 
ing overhead 

Dynamical decoupling is most efficient against bounded 
environments, when the pulse switching times are short 
on a scale set by the bath spectral density high-frequency 
cutoff [E @, or when the spectral density is rapidly de- 
caying Within these assumptions different flavors of 
DD can be designed. Of course, technology limits how 
strongly, rapidly and accurately we can modulate the 
system Hamiltonian, and cool the system. Dynamical 
decoupling can be implemented with the pulse sequence 
chosen deterministically, e.g., periodically _8, 9], or 
randomly [l^, [HI . Randomized decoupling is expected 
to perform better in the case of varying/ fluctuating (ef- 



fective) Hamiltonians while deterministic methods per- 
form better in cases where the undesired terms in the 
system-bath Hamiltonian are sufRciently weak [l^, [l^. 
Hybrid schemes with optimized performance have also 
been considered [111, . The analysis of DD schemes is 
often performed within an interaction picture. Here we 
consider explicitly the internal dynamics of the bath in 
terms of its effect on the performance of DD. 

Dynamical decoupling strategies, some of which can be 
derived from group theoretical considerations [l^, are 
typically based on a universal DD pulse sequence [3]: 
a short sequence of unitary operators designed to com- 
pletely cancel errors up to the first order in the Magnus 
expansion p^ . Here we consider two deterministic de- 
coupling schemes: (i) In periodic DD (PDD), the univer- 
sal decoupling sequence is repeated periodically for the 
duration of the quantum state preservation, (ii) In con- 
catenated dynamical decoupling (CDD) the univer- 
sal pulse sequence is recursively embedded within itself. 
We provide an analytic leading-order study of the perfor- 
mance of the above strategies. Our first basic finding is 
a verification in the DD-setting of a result familiar from 
NMR, that even when ideal (zero-width) pulses are used 
for decoupling, the corrections from second and higher or- 
der Magnus terms impose an upper performance bound 
on PDD. 

A central result of our approach is that the coupling 
terms responsible for errors undergo an effective renor- 
malization transformation by the externally applied pulse 
sequences. This process is conveniently described via the 
Magnus expansions for derivation of effective coupling 
Hamiltonians [see Eqs. (|^T|) - (P^ below]. The renormal- 
ization approach leads to the view of DD as a dynamical 
map, whose convergence to a fixed point (ideally, the 
cancellation of the system-bath interaction) depends on 
whether the norm of the coupling terms decreases under 
repeated applications of the pulse sequence. In support 



2 



of our earlier study [17|, within the technological con- 
straints of finite pulse numbers and the bounds imposed 
by the applicability of DD in general, we analytically 
prove the asymptotic superiority of CDD over PDD. In 
addition, we present here new pulse sequences, inspired 
by the Trotter-Suzuki expansion [isj . with even better 
convergence properties than CDD. In a more abstract 
setting, we show that in fact any application of unitary 
operators on the system cannot cause an increase in the 
strength of the undesired couphngs. 

Our conclusions are valid within the convergence do- 
main of our expansions. We find that the Magnus expan- 
sion itself sets the most stringent limit on convergence 
domains, in the sense that it includes or coincides with 
the regime of applicability of DD. In the worst case, this 
corresponds to the limit of slow internal bath dynamics. 

We also analyze robustness with respect to systematic 
pulse errors. The decoupling error of any deterministic 
scheme is thus a result of the environmental coupling er- 
rors, and errors in the decouphng operations. In the case 
of realistic, imperfect pulses, we show that the perfor- 
mance of DD, when pulses of finite width (or uniform 
error rate) are used, is determined by a condition based 
on both the pulse switching times and pulse widths. In 
this case, unless pulse profiles and timings are adjusted in 
case of systematic pulse errors, even the first order terms 
(and thus dominating) in the Magnus expansion will be 
non-zero. 

While undesired coupling terms are responsible for fi- 
delity loss, the relationship between the two is compli- 
cated and depends on the details of the environment and 
possible physical energy cutoffs. This motivates us to 
perform a generic analysis based on operator-norm esti- 
mates. Our conservative estimates provide a worst-case 
analysis for decoupling performance. We expect these es- 
timates to be useful as guidelines for choosing and com- 
bining dynamical decoupling strategies when constraints 
such as pulse switching times and pulse errors are con- 
sidered. 

This paper is organized as follows: in Section |ll] we 
review the basic universal DD cycle which suffices to 
decouple a qubit from an arbitrary non-Markovian en- 
vironment to first order in the Magnus expansion. In 
Section [TIT] we provide a detailed analysis of this se- 
quence in terms of the Magnus expansion, for both ideal 
(zero-width) and non-ideal (finite-width) pulses. In Sec- 
tion [IV] we compare two deterministic decoupling strate- 
gies founded on the basic universal DD sequence: peri- 
odic and concatenated sequences. We calculate a fidelity 
measure associated with the two strategies and show that 
the concatenated one strictly outperforms the periodic 
one. In Section |V] we introduce a new decoupling strat- 
egy, based on the Trotter-Suzuki expansion. Even though 
this decoupling sequence has implementations problems 
and is not as robust as CDD, we find it interesting in light 
of its superior convergence properties. This Section also 
includes a table which compares the three deterministic 
decoupling strategies (PDD, CDD, and Trotter-Suzuki), 



and clearly illustrates and summarizes their relative per- 
formance. Finally, in Section IVTl we present a general re- 
sult concerning the behavior of error norms under pulse 
sequences: we show that for sufficiently narrow pulses, 
pulse sequences cannot increase error norms. This result 
has impact also on fault tolerance theory using quan- 
tum error correcting codes. A summary and discussion 
is presented in Section fVIII Supporting calculations can 
be found in the Appendices. 



II. UNIVERSAL DYNAMICAL DECOUPLING 
FOR A QUBIT 

In the absence of driving terms, the terms in the 
system-bath interaction Hamiltonian are responsible for 
decoherence and loss of quantum information. Removal 
of these terms is sufficient (but not necessary ^T^) for 
preservation of arbitrary quantum states. Dynamical de- 
coupling schemes use strong and fast control Hamilto- 
nians acting on the quantum system only, to effectively 
remove/modify various terms in the system-bath inter- 
action Hamiltonian [20j . In particular, a pulse sequence 
designed to remove every term in the interaction Hamil- 
tonian is referred to as universal dynamical decoupling. 
In this work, we focus on the universal DD of a single 
qubit. Extensions to multiple qubits \M a nd higher di- 
mensional quantum systems exist pll. l22j. but we will 
not consider these here. We use X, Y, and Z to denote 
the standard 2x2 Pauli matrices 

= (? J) , = ( ■ o') ' = (o -l) 

acting on a single qubit, and work in units of ft = 1. 
System and environment are assumed to inhabit differ- 
ent Hilbert spaces i.e., we do not consider leakage, which 
can also be treated using DD |21j, ^2^]), and all Hamilto- 
nian operators are taken to be traceless without loss of 
generality. 

Consider a qubit with a Hamiltonian 

H{t)^H,t,i{t)+H,{t), (1) 

where i?ctri refers to a time-dependent controllable 
system-only part and includes all other terms, i.e., the 
internal bath, internal system, and interaction Hamilto- 
nians: 

H^ = Hb(E)Is+Ib(»Hs+ Hsb- (2) 

Here / denotes the identity operator. We have implicitly 
excluded a pure-bath Hamiltonian term Hg Is satis- 
fying [Hg Is,F] = 0, where F is any element of the 
Lie algebra generated by Hsb, Hb ® Is, and Ib Hs- 
The reason is that such a term on the one hand does not 
impact the system dynamics, but on the other hand will 
increase the operator norm that arises below in our cal- 
culations of decoupling errors. With this in mind, ideally 
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one would like to have = 0- The "error Hamiltonian" 
can always be expanded as: 

He ^ Bo Is + Bx (E) X + By (^Y + Bz ® Z (3) 

where Ba (a = 0, X, Y, Z) are operators acting on the en- 
vironment. We are allowing the Ba to include the iden- 
tity operator, i.e., from now on we are incorporating Hg 
into HsB- I-e., assuming 

Hs = ^ WqCTo, (4) 

Q—x,y,z 

with bJa all non-zero frequencies, and writing 

HsB^ X! ^a®(ya, (5) 

yields 

Ba = Uals+ba ae{x,J/,z}, 

Bo - Ib+Hb. (6) 

Note that the first term in H^, Bq ^ Is, is a pure- 
environment term and simply generates the environ- 
ment's internal dynamics. It also includes the global 
phase generating term Ib (E) Is- Obviously if bx — by = 
bz = 0, then — Bq ® Is has no effect on the system 
dynamics. 

Universal DD of a qubit with ideal pulses removes ev- 
ery term in except Bo>S) Is, by applying the following 
pulse sequence: ±X±ZfXfZ, where f denotes a "pulse- 
free" period of fixed duration. The pulses are generated 
by i?ctri- This universal DD sequence is a simplification 
of {IsfIs)iXfX){YfY){ZfZ), where X,Y,Z and the 
identity Is represent the decoupling group Q on the qubit. 
The universal decoupling group has the property that 
for every Hamiltonian H acting on the system, the sum 
J2{Pieg} BiHPi acts trivially on the system [3]. Since 
this sum is the leading order generator in the Magnus ex- 
pansion, the universal DD sequence completely removes 
any system-bath interaction to first order (we revisit this 
in detail in subsection IIII B[) . 

A complete analysis of the performance of DD needs to 
take into account details of the environment (participat- 
ing modes, energy cutoffs, temperature, (dis)equilibrium, 
etc.) but we shall minimize these considerations and fo- 
cus on the model-independent features of DD. Our anal- 
ysis is mathematically constrained by convergence do- 
mains that are explicit in our approximations. We expect 
certain unbounded systems such as bosonic environments 
to be within the realm of our theoretical framework after 
the introduction of spectral cutoffs. 

III. ANALYSIS OF THE UNIVERSAL 
DECOUPLING PULSE SEQUENCE: THE 
RENORMALIZATION TRANSFORMATION 

In this section we derive the transformation of various 
Hamiltonian terms under the basic universal DD pulse 



sequence. We first consider ideal pulses, then amend our 
discussion to allow for non-ideal (finite width) pulses. We 
will see that the system-bath interaction Hamiltonian is 
effectively renormalized under the DD pulse sequence. 
As long as this renormalization transformation is norm- 
reducing, the DD procedure is effective. 



A. Ideal Pulses 

Let us first assume that the pulses used are ideal, i.e., 
infinitely strong and narrow. For example, i?ctri(^) = 
^6{t — to)X generates an ideal X pulse at time to {S is 
the Dirac (5-function) . The propagator corresponding to 
free evolution f of period tq is given by: 

Ui=cxp{-iToH^"y), i/("):=i/e. (7) 

To obtain the cycle propagator we use the identity 
Me^M^^ — e*^'^*^ , valid for any operator A and in- 
vertible M. Define 

Hi = Bo (E I + Bx (E X + By (EY + Bz E) Z = IHJ, 
H2 = Bo®I + Bx®X-By®Y-Bz®Z = XHeX, 
H:i = Bo®I-Bx®X + BY®Y-Bz®Z^ YH^Y, 
H4, = Bo®I-Bx®X-By®Y + Bz®Z^ ZHeZ, 

(8) 

The free evolution propagator can then also be written 
as f— exp(— iTo-ffi). Using Eqs. ^ we can write the 
total propagator corresponding to the universal DD cy- 
cle, {IfI){XfX){YfY){ZfZ), in terms of four effective 
Hamiltonians describing the different evolution segments, 
as: 

Ui = e~"'>"^ e~iroH2 ^-iroHs ^-iroH, ^ ^g-^ 

A time- varying piecewise constant Hamiltonian, H{t) 
varying over four intervals each of length tq, can gen- 
erate this propagator. The total propagator can then be 
used to define the effective Hamiltonian H'^^^: 

4 

Ui = nexp(-iroi/f ^) =: exp(-z4Toi/(i)), (10) 

2=1 

where we have added superscripts (0) to the Hamiltoni- 
ans Hi of Eq. ([5]), in anticipation of the concatenation 
procedure that we consider in subsection IIV Cl 



B. Magnus Expansion 

The cycle propagator Ui [Eq. PH)) ] can be approx- 
imated using a Magnus expansion (for an alternative 
method of analysis that is particularly useful for the de- 
sign of periodic sequences of soft pulses see Ref. f23|). 
Consider a time-dependent Hamiltonian H{t) generating 
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the propagator U{t) from time to t. In the Magnus 
expansion (a type of cumulant expansion) we have 

oo 

Uit)^expJ2Mt). (11) 

with Ai and A2 given by: 

Ai^^i f dhH{h), (12) 
Jo 

A2--i/ dh [ \t2[H{h),H{t2)]. (13) 

^ Jo Jo 

Higher order terms are given by higher order commutator 
expressions p^ . A recent bound for the convergence ra- 
dius of the Magnus expansion [23| translates in our case 
into max ||77(t) ||t < 2.^ In many situations ||i?o|| (norm 
of the environment's internal Hamiltonian) is expected 
to dominate the Hamiltonian and we may as well use 
11^0 II ^ ^ 1 as a conservative convergence domain. This 
bound can, however, be superficial since not all degrees 
of freedom of the environment might actually be involved 
in the dynamics. For example, in a spin bath, bath spins 
far away from the system will not immediately contribute 
to the dynamics but this will nonetheless increase ||i?o|| 
without changing the real convergence radius. A pre- 
cise analysis of the actual convergence radius requires 
us to estimate the next-to-leading-order terms - see Ap- 
pendix 13 

For the piecewise constant evolution of the Hamilto- 
nian in Eq. we can calculate Ai and A2 in terms 
of H{t) = {H, for (j - l)ro < t < jto}^^^, i.e., the 
sign- flipped Hamiltonians appearing in Eq. ([9]): 

= -^roiHf^ + ui"'^ + H^^^ + ijf^) (14) 

A''--lr^o E [^f^^f]- (15) 

l=i<j=4 

Again, the superscripts are included in anticipation of the 
CDD analysis below. Note that the pure-environment 
parts of A^^'\ i.e., terms of the form B®I, have no effect 
on the dynamics of the qubit to first order in tq, but do 
have an effect to second order in tq, through the com- 
mutator terms. Clearly, pure-environment terms are not 
renormalized under the DD procedure. 
Using Eqs. (IH1),(|I11),(IIS1) we find: 

a'^P = -i{4To)Bo^I 

+2T^{[Bo,B,]-i{Bx,Bz})®Y (16) 



Throughout this work we use ||j4|| to denote a unitary invariant 
operator norm, e.g., the maximum eigenvalue for traceless oper- 
ators, or the absolute difference between the smallest and largest 
eigenvalues [25II . 



This shows that while to first order in the Magnus expan- 
sion (the ^^^^ term) the universal decoupling cycle com- 
pletely removes the coupling to the environment, there is 
a leading second order correction due to ^2^' in which 
the coupling to the environment has not been removed. 

Note that a pure-environment term appears only in 
A']^"^ and, due to our particular choice of DD sequence, 
iXiZiXiZ, there is no term involving (g)Z in A'^K 

We now define two norms which will play a central role 
in our analysis: 

/3:= llBoll <oo (17) 
J := max(||Bx|| , ||By|| , \\Bz\\) < ^. (18) 

Recall that Ba = i^als + ba for a G {x,y,z}, and 
Bq = Ig+Hg; unless otherwise specified, we assume that 
J < /3 in order to simplify our convergence arguments. 
This conservative assumption is reasonable for systems 
where only a small number of environment particles (or 
degrees of freedom) are coupled to a given qubit (such as 
electron spins coupled to a nuclear spin bath [2^) - which 
translates into a small ||6a|| - whereas no restriction ex- 
ists on the environment self-Hamiltonian {e.g., on the 
number of particles). The distinction between J and /3 is 
a reflection of the different roles Hsb and Hb play in the 
dynamics of the system. In simple terms, J quantifies the 
direct coupling strength while (3 quantifies typical bath 
frequencies. Consider, e.g., the simple case of a spin qubit 
coupled to another spin- 1/2 particle via a Heisenberg 
couphng: He=LoZB®I + c{Xb ® X + Yb®Y + Zb® Z), 
where c is the coupling coefficient. In this case we have: 
J = 0(c) and (3 = 0{lo). 

The Magnus terms can be bounded using these quan- 
tities: pf^ll = 0(to/3) and ||A^^^|| = 0{t§(3J). Higher 
order Magnus terms, ^■>2i "^ill contain all orders of 
rgJ*^/?'"*^ where 1 < k < i — 1, and the leading term 
is always given by 0{tqJP^~^) since J < (i. As long as 
To/3 <C 1 we can safely neglect A^^^^'- 

Ii4>^2ii«ii4'^ii<ii4'^ii- (19) 

Note that our derivations are based on the separation of 
the coupling terms Bq®! and Bx®X + By®Y+Bz®Z 
in the error Hamiltonian H^. A similar separation can 
be done for decoupling schemes on systems other than a 
single qubit. 

The approximate effective Hamiltonian i?'^^ cor- 
responding to the basic dynamical decoupling cycle, 
Eq. Uni), is now: 

QL 

The renormalized environment operators b'^^ , which can 
easily be read off from Eqs. are the main result 

of the DD procedure. The important message emerging 
from the analysis in this subsection is that even when 
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ideal (infinitely strong and narrow) pulses are used, the 
universal decoupling cycle only removes (the system-bath 
component of) the lowest order Magnus term, and renor- 
malizes the higher order Magnus terms. The success of 
dynamical decoupling ultimately depends on whether the 
mapping to the renormalized environment operators is 
significantly norm-decreasing, an issue we address in de- 
tail below. 



C. Pulses of Finite Width 

Ideal pulses that act as system-only unitary operators 
are simplified mathematical abstractions. In this subsec- 
tion we model and analyze the effect of the finite width 
of pulses in decoupling. For simplicity we consider rect- 
angular pulses P with a width S. The ideal finite-width 
pulse is. 



P = exp{~i5Hp), 



(21) 



where Hp is a fixed control Hamiltonian. For realistic 
pulses we must include He in the pulse propagator: 



Up = e^p{-iS{Hp + He)). 



(22) 



We call such a pulse "non-ideal" . Extremely narrow 
pulses with ||-ffp|| ^ ||-ffe|| are thus desirable to mini- 
mize the effect of the unwanted terms in the pulse Hamil- 
tonian. Here we build upon the approximation of instan- 
taneous pulses in subsection IIIIBl by decomposing the 
non-ideal pulses into products of the ideal unitary opera- 
tor of the pulse P and some effective pulse error unitary 
operator Ep. We explicitly approximate the operators 
Ep for rectangular pulses on a qubit, but the decompo- 
sition of the actual pulse into an ideal pulse and a "pulse 
error" unitary can be reproduced for other pulse shapes 
as well. 

The periods of the universal dynamical decoupling cy- 
cle need to be adjusted in order to incorporate the time 
delays associated with finite pulse widths. Therefore, as- 
sume that all free evolution periods, with propagator Uf, 
are adjusted to length tq — 6. The propagator for the 
cycle can then be written as: 

f/''' = UtUxUiUzUiUxUiUz 

= UiExXUiXE'zYUfYE'xZUiZEz 

= ■■ Ui,ExUi,E'zUf,E'xUuEz (23) 



where 



= f.-i&iv'y.+H,) a^X.Z, (24) 



St] — 7r/2, and, in order to fit the formulation of subsec- 
tion nil Bl we have defined the pulse-error operators as 
follows: 



ExX := Ux, 
ZEz := Uz, 



YE'xZ := Ux, (25) 
XE'zY := Uz. (26) 



Note that since the errors Ea are unitary and are pro- 
duced during an interval S, we may formally associate 
them with effective Hamiltonians defined through 



E^ 



exTp{—i6HE,, 



(27) 



Using these definitions, Eq. is equiva- 

lent to the evolution due to a piecewise con- 
stant Hamiltonian H{t), given by the sequence 
{Hi, He,x, H2, H'^ Zj H'^^, H4, He,z} with Hi 
given in Eqs. (O, at appropriate times. We ignore terms 
of order and ^tqUBqIP, which allows us to 

treat the components of He^oi as c-numbers instead of 
operators, since no commutators will be involved. Using 
Eqs. P^ . (fT5|) . we can repeat the calculation of A^^-* and 

A2^\ this time including the pulse segments with He. a 
as their effective Hamiltonians, and consider the limit 
of narrow pulses, S <C tq. In this limit, we can safely 
truncate the Magnus expansion after A^2 \ provided we 
assume: 



CTqP 



d— <Cl, 

To 



(28) 



where c and d are numerical factors of 0(1) (recall 
that (3 := ||-Bo||)- This inequality is derived in Ap- 
pendix |B] Note that it implies an optimal pulse interval 
To = y^d6/cf3, which minimizes the left-hand side of ((28|) 
for given /3 and a fixed minimal pulse width S. Note fur- 
ther that Ineq. ((28)) is not as strict as the condition for 
the convergence of the Magnus expansion, which reads 
(when /? ^ J): /3T < 1, where T is the total experiment 
duration. 

The components of the effective Hamiltonian can be 
calculated explicitly: 



(1) 

Q 

(1) _ 



0, 



B'^>^^{To-S)[Bo,BP] + -{lB^ 

Tq 2 



(0) 



-B 



(0)n 



(29) 
(30) 



4) = lr„([Bo,4"'] 



+ -5([i3o,i?lyV2^{4°^4'] 



B 



(1) 



2 "(o)/„(o) 



2^4)4°)) 



(31) 



iS 



-4^(4 



4°)) + 4^5^°^ 



TT 



Y 



-B 



(0) 



Tq 



(32) 



The only modifications associated with the pulse width 
S are of order 0{J{SJ + S/tq)), associated with the new 
small parameters SJ and S/tq. In this case the decou- 
pling is not exact and even the first order Magnus terms 
contribute decoupling errors of order SJ, the "per-pulse- 
error". This is an important effect which will adversely 
affect decoupling schemes not designed to compensate for 
such finite pulse-width errors. 
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We note that it is possible to design a piece-wise con- 
stant profile for the control Hamiltonian -ffctri such that 
the first order Magnus corrections due to systematic er- 
rors in the control Hamiltonian are zero [23] ■ The sepa- 
ration of non-ideal pulses into ideal and error pulses, as 
above, still applies to this "Eulerian decoupling" scheme, 
as do most bounds we obtain here. Finally, we note that 
treatments of decoupling and refocusing errors similar to 
the above have been pursued in an NMR-specific setting 

M. 



IV. DECOUPLING STRATEGIES 

The universal DD cycle results in segments of evolu- 
tion with Hamiltonians Hi such that ^ acts triv- 
ially on the system. The derivations of the previous 
section show how the actual overall propagator contains 
higher order corrections that do not act trivially on the 
qubit. Nonetheless, the basic universal DD cycle pro- 
vides us with the building blocks of general decoupling 
schemes that optimize decoupling performance with re- 
spect to constraints in switching times and pulse preci- 
sion. Numerical simulations comparing and discussing 
some of the schemes (deterministic, randomized, or hy- 
brid) are available [H, [13, [2§| (ideal pulses) and 
(ideal and non- ideal pulses), and in this section we focus 
on system-independent analytic arguments. We deviate 
from our abstract treatment of bath operators by con- 
sidering two limiting cases for the coupling strengths of 
system-bath and pure-bath, namely: (i) J < /? and (ii) 
/3 <C J [recall Eq. (HH])]. In case (i), the coupling to the 
environment induces slow dynamics while the environ- 
ment itself has fast dynamics. In case (ii), the coupling 
to the environment is dominant but relatively stable due 
to the environment's slow internal dynamics. This regu- 
larity makes case (ii) more attractive for dynamical de- 
coupling, or similar methods js^ , while case (i) is a worst 
case scenario. As will be noted however the presence of 
higher-order commutators blurs out the distinction be- 
tween the two cases in higher orders of concatenated de- 
couphng (Subsection IIV Cp . Nonetheless, both cases are 
still within the convergence domain of Magnus expansion 
(||-ffe||r < 1, see subsection IIIIBI) . Outside the conver- 
gence domain (e.g. corresponding to a longer duration 
of the experiment), deterministic decoupling might be 
replaced by randomized decoupling methods, for which 
there is some evidence of better performance [l2|. In 
practice the co-existence of various bath regimes makes 
hybrid decoupling methods a practical choice for long- 
time decoupling [29|. 



A. Error Phase 

We require a measure of fidelity to quantify the per- 
formance of DD. To this end we define the error phase 
corresponding to a propagator U = e~'^^= , describing 



an evolution of total duration T generated by an effec- 
tive Hamiltonian Hp, as 



where 



\H,-Bo^I\\ 



(33) 



(34) 



is the norm of the non- pure-environment part of H,,, 
which is effectively (like J) a measure of the coupling 
strength. The pure-environment part is explicitly ex- 
cluded from the error phase since it does not affect the 
fidelity / (state overlap between ideal and decoupled evo- 
lution) up to the leading order in our expansion. In this 
manner the error phase now simply connects the coupling 
terms in the Hamiltonian to the infidelity and decoher- 
ence. Indeed, for small error phases the infidelity, 1 — /, 
depends monotonically on $ - see Eq. ([50)) below. 

In any physical implementation of dynamical decou- 
pling we are limited by technological constraints. Let 
N denote the number of pulses used during a decou- 
pling experiment of duration T. Normally this number is 
bounded above due to a minimum pulse switching time 
TVnin: N < T/Tmin- A basic pulse width 5 is used when 
required, to characterize the systematic error due to non- 
zero pulse widths. These technological constraints are 
incorporated below by evaluating the fidelity gain due to 
decoupling in terms of N, T, and tq. 



B. Periodic Decoupling 

In periodic DD for a qubit, a basic universal sequence, 
such as fXiZ iXiZ, is repeated periodically over the 
whole interval T. If N pulses are used, there are then 
N intervals of length tq = T/N that correspond to the 
free evolution periods, and iV/4 repetitions of the basic 
sequence. The effective Hamiltonian for the total inter- 
val, is obtained from the results of the previous section, 
as long as we are within the convergence limit of the 
Magnus expansion, given by r||i/e|| < 1- Limiting the 
Magnus expansion to the first two terms, Ai and A2, the 
propagator for the total evolution is given by 



U « exp[A[^^ + 4'^] = exp(-iri/(i) 



exp 



-iT \ Bo(E)I 



FlpBaBp 



(35) 



where the effective coupling coefficients F^p are given 
in Eqs. and can be calculated for any decoupling 
scheme (not just the universal sequence). 

For brevity we introduce the parameter G := 
max(J, /?). We read off the error phase from Eq. ([55[) 
as 



(1) 

PDD(i) 



= T/i = 0{TJtoG). 



(36) 
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The above estimate for ideal pulses will be modified to 
the following if rectangular pulses of width S are used: 

*PDD(i)W-0[TJ(roG + ,5/ro)]. (37) 

Our estimates show that in if ideal pulses are used, using 
a higher number of pulses, at fixed T, leads to monotonic 
improvement in the error phase. I.e., the error phase is 
proportional to the pulse interval tq. Technology sets a 
lower limit r^^in on tq, which implies that the infidelity is 
bounded from below by a monotonic function of TminTJG 
in the ideal pulse limit, and the fidelity gain scales with 
the number of pulses N. From Eqs. ([571) fo^' non-ideal 
pulses, we expect the optimal pulse interval to be given 
approximately by tq = max{Tinin, ((^/G)^/^}. In these 
expressions we have assumed that the pulse width S is 
already at the technological lower limit. 

C. Concatenated Decoupling 

1. Definition 

Significant improvement over periodic DD can be ob- 
tained by constructing a concatenated sequence, i.e., 
by recursively embedding the basic universal DD cycle 
within itself [17| . This is done in the following manner: 

Po = f 

Pi = Po-'^Po^Po-'^Po^ 

Pn — Pn-lXpn-lZpn-lX-pn-lZ. (38) 

Here po (no pulses) is of duration tq, pi is of duration 
Ti = Atq (in the limit of ideal, zero- width pulses), and p„ 
is of duration r„ = 4t„_i = 4"ro {n levels of concatena- 
tion). 

As we are about to show, PDD dramatically outper- 
forms CDD over a wide parameter range. At an intuitive 
level, this is attributable to the fact that CDD, with its 
self-similar structure, has error correcting capabilities at 
multiple resolution levels, whereas PDD allows errors to 
accumulate essentially as a random walk. 

First, however, let us remark that the aperiodic se- 
quence pn, may be simplified using Pauli matrix prod- 
uct identities when Pauli operators appear in succession 
(XY = Z and cyclic permutations), in the same man- 
ner that the universal decoupling sequence is simplified 
from {IsfIs){XfX){YfY){ZfZ) to pi. The reduction 
in the number of pulses gained by such algebraic cancel- 
lations might not be strictly advantageous in a practical 
setting, since experimentally it is not always possible to 
generate rotations around all three axes. Moreover, the 
simplification does not change the asymptotic behavior 
of the number of pulses as a function of the concatenation 
level n. The simplification does affect the formal recur- 
sive structure of the sequence. This has no physical ef- 
fect when decoupling pulses are ideal. But with realistic. 



finite width pulses, this loss of self-similarity might ad- 
versely affect the robustness of the pulse sequence against 
systematic errors. On the other hand one could argue 
that the product of two slightly wrong pulses is worse 
than one, which would be an argument in favor of sim- 
plification. A clear decision one way or the other must 
be made in a context-specific setting. 



2. Effective Hamiltonian 

Due to its recursive definition, the propagator corre- 
sponding to Pn-i is generated by an effective Hamilto- 
nian H^^ , which is then decoupled with p„ and in turn 

(n) 

generates He ■ The interval length is multiplied by 4 in 
each such recursive step in which the effective Hamilto- 
nian is renormalized. The truncation of the Magnus ex- 
pansion beyond the first two terms must be justified, so 
that the higher order terms do not accumulate as the con- 
catenation level goes up - we do this in Appendix [XI We 
can then construct the higher order effective Hamiltoni- 
ans by truncating the Magnus expansion and recursively 
obtaining from TJe"^^-*. In the following, ijf"'' are 
constructed as in Eqs. ([5]) and are reproduced each time 
from the Magnus expansion: 

i/f) = ^(4") + 4")) 

-^r„ [H\''-'\H^r"\ (39) 

l=i<j=4 

The sum of the operators H^"" is independent of tq and 
contributes to the pure-environment part. Nonetheless, 

(n) 

He contains the commutator terms that do include 4"ro 
(and contribute to the error-terms acting on the system). 
The commutator [h["' must compensate for 

this exponential growth. 

For the qubit case we can derive the explicit form of the 
sequence of effective Hamiltonians ffi"'', by finding the 

in) 

Ba ■ We already have the first step of the concatenation 
in Eq. (|16p . This also serves to initialize the recursion. 
Proceeding recursively we obtain, for the next iteration: 

r(1) ^ r(2) _ o 

r(1) ^ R(2)_,vaR(l) r(1)1-,V[R r(1)i 

B« - S(^)=zni([5«,4^)]-^{5«,BW}) 
B'i^ ^ Bf = 0. (40) 
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Therefore, for general n: 

B^"^ = So, for 71 > (41) 

B^,"^ = (ir„_i)[So, b'^;1-% for n > 1 (42) 



4"' = i(zT„_i)[i?o,i3^"~'^], forn>2 (43) 

for n>l (44) 



B^") = 



These equations capture the essence of the renormaliza- 
tion transformation the error terms experience under the 
pulse sequence. 



3. Convergence and Performance 

Next we study the convergence conditions of the latter 
recursive relations. Let us define 



:=max{||4")||,||i?(r)|l}. 



(45) 



Since i/e"^ ~ J2a ^""^ ® ^a, /i'"-* is closely related to a 
bound on -ffi"'' . 

We consider the analog of case (i) in PDD, i.e., let 
J < (3. Then, after recursively applying Eq. (|^^ and 
the inequality < 2\\A\\ \\B\\ (valid for bounded 

operators A and B 25]), we have 



B 



(") 

X 



n-l 



Tl-l 



4") 



i=0 

2"'(/3ro)"J. 



= j[](2x4Vo/3) 



(46) 



Because of the factor of 1/2 in By"^ [compare 
Eqs. we also have: 



B 



in) 



< 



B 



(n) 
X 



for sufficiently large n. (47) 

Therefore, at the final concatenation level n = Uf 

< 2"/(/3ro)"^ J. (48) 

The total duration T is given in terms of tq and nj as: 

T = r„^ =4"/ro =7Vto. (49) 

Let us now give the connection between the fidelity 
and the error phase. The propagator corresponding to 



rr("/) 



and 



the whole sequence is given by exp 

the ideal evolution of the system is given by the identity 
operator. If fidelity is measured as the "state overlap 
between the ideal and the decoupled evolution" , we can 
write l3l|: 



= 1- 



CDDj 



(50) 



where H_ refers to the system-traceless part of H. 



Generally, a different universal DD pulse sequence as 
the basic cycle of concatenation will modify Eqs. ([32])- 
(|43ll but the (asymptotic) form of Eq. (|48p remains the 
same. The overall error phase $cdd = T/i'^"-'^ can be 
bounded from above using Eqs. (|48|) . (|49)) : 



ODD 



< (/3T/7Vi/2)i°g4A^(jr). 



(51) 



We expect the above bound to be satisfied provided the 
convergence condition PT/N^/"^ < 1, i.e., /3 < {Ttq)-^^^, 
is satisfied. However, this condition is less strict than the 
condition appearing in Appendix 1X1 for the truncation of 
the Magnus expansion: (3 < 1/T [Eq. (jAlOp j. 

For comparison, we also estimate $pdd = Th^^\ i.e., 
simply take nj = 1 in Eq. (|48| : 



$PDD = 2(/3ro)(JT) = 2{(3T/N){JT). (52) 



Indeed, this agrees with Eq. ([36|) . Note that for = 4, 
and taking the equality signs in Eq. (j5ip . we have as 
expected <I>cdd = $pdd- Now, we may conclude, by 
comparing Eqs. ([51]) and ([52]), that when /? < (Tto)"^/^, 
*1?CDD converges quickly to zero as the number of pulses 
increases, while no such convergence is observed for PDD 
over the same total sequence duration. In particular, 
while the fidelity gain in PDD scales with the num- 
ber of pulses TV, it scales with (A^i/Vc)'°^*^(» A^ for 
Af ^ 1) for CDD, where c = /3T is a small constant 
regulated by the actual convergence domain. On the 
other hand, this convergence domain puts a physical up- 
per limit on the number of concatenation levels, imposed 
by /3 < (Tto)-^/2 or the stricter /3 < 1/T (c < 1). 

Another way to compare CDD and PDD is as follows. 
By fixing the value of c and /3, we can back out an up- 
per concatenation level: n™'^'^ = — log^{f3To / c) . Inserting 
this into Eq. ((5T|) we have: 



CDD 



< (c/3ro) 



{JT). 



We can now compare the CDD and PDD bounds: 

3^0 



*CDD ^ (c/3ro) 



PDD 



2(/3ro) 



0, 



(53) 



(54) 



which serves to show that CDD is indeed superior to PDD 
in the (relevant) limit of small (3tq. This key result was 
first reported in our previous study [17], without a full 
proof. 

When the dynamics is dominated by direct system- 
environment coupling, namely (3 <^ J [case (ii)], we find 
that the third order Magnus term dominates the effective 
pure bath term. We find that the effective coupling is 
then bounded by: 



< c"/(max(/3',/3)ro)"^J^ 
where c = 0(1) and 



for n/ > 2 (55) 



(56) 
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is an effective pure-bath term that arises in the 3rd or- 
der Magnus expansion and kicks in at the second level of 
concatenation. The asymptotic behavior in the effective- 
ness of dynamical decoupling is thus the same as case (i) 
[compare to Eq. (|48)) ]. however, due to the dependence on 
a higher power of J, we see that this is a more favorable 
scenario for CDD. 

Any universal decoupling sequence (e.g., higher order 
sequences) can be concatenated and our analysis still ap- 
plies. When this is done, Eq. ([5T|l becomes 



*CDD < $o(a^"°) 



(57) 



where $0 is the error phase for a free evolution of the 
system for time T. The parameter a is generally bounded 
by a power of ||iJe||T^ and is required to be small [it is 
analogous to (3T in Eq. I|5ip]. The parameters a and b 
are 0(1). The parameters a, a, b all depend on the basic 
decoupling cycle used and are all positive. 



for decoupling. In fact randomized decoupling techniques 
have been shown to be efficient in the limit of fast time- 
varying/fluctuating system-bath Hamiltonians for which 
the deterministic methods (PDD/CDD) are relatively in- 
effective 1131. 



E. Example: Decoupling in Spin Quantum Dots 

In this subsection we apply our analysis to the case of 
an electron spin in a quantum dot coupled via the hy- 
perfine interaction to a bath of nuclei. The Hamiltonian 
for the interaction of an electron with spin S (the qubit), 
confined in a semiconductor quantum dot, with a collec- 
tion of nuclear spins /„, is given by: 

H = Hs + HsB + Hb 

= nZ + Y, Anin ®Z+Y. Bnm[lnlm] (59) 



D. Finite Width 

We can analyze the finite-width pulse CDD procedure 
by using the pulse error unitary operators introduced in 
subsection IIII CI The systematic error associated with 
these pulses at each level leads naturally to a decoupling 
error, and is corrected at the next level of concatena- 
tion. For convergence, we require the coupling strength 
to shrink as a function of concatenation level. We ob- 
tain the following condition for convergence of the con- 
catenation procedure for rectangular pulses, derived in 
Appendix [B] 



c'ro/3 + d'— < 1. 

To 



(58) 



where c' and d' are numeric factors of 0{1). This in- 
equality is a special case of Ineq. (^5)) , so the latter is al- 
ready sufficient to guarantee convergence. As mentioned 
there, the finite-width convergence condition implies an 
optimal pulse interval tq — y^d'S/c'f3, for given f3 and a 
fixed minimal pulse width 5. We reiterate that we have 
required 5 ^ tq, but we expect this requirement to be 
inessential as evident from the exact numerical simula- 
tions reported in [l7l | . For example when a technological 
lower limit on the smallest switching times was used, we 
obtained a "closed-pack sequence" with S w T,„iii that 
still provides significant decoupling. 

Generally, errors associated with non-zero pulse widths 
are a combination of systematic and random pulse errors. 
By construction, the recursive nature of CDD tolerates a 
significant level of systematic pulse errors, since the de- 
coupling error at each level is "cleaned up" at the next 
level of decoupling. Random pulse errors are tolerated 
to some extent as well |17| : this is reminiscent of ran- 
domized decoupling techniques where pulses implement- 
ing random unitary operators on the system are utilized 



where An and Bnm are coupling constants and 
Bnm[lnlm] is a shorthand for the diagonal and off- 
diagonal dipolar coupling terms among the nuclei. The 
intra-nuclear dipolar coupling is short-ranged {Bnm oc 
^mri)- The hypcrfinc interaction between the electron 
spin and the nuclei is a Fermi contact interaction, whose 
magnitude is given by A„, which is proportional to the 
electronic wave function magnitude at the position of 
nucleus n. The parameters relevant to our bounds are 
given by J = 0(/E„^") and /? = 0(/2 i3„,„), 
where I{~ 0(1)) denotes the nuclear spin. The number 
of nuclei within the radius of the electron wavefunction 
is of the order of 10^ and for GaAs we have j3 ^ lOkHz, 
and J ^ IMHz [s^ . |33| . Let us define an error rate 
(f>/T for an evolution of length T with an er- 
The uncorrected (pulse-free) error rate 
0{J) ^ IMHz. Let us take our inter- 
pulse interval tq — T/N, to be smaller than l^s. At any 
pulse rate faster than this, FDD provides improvement: 
epDD = j'^To < IMHz [Eq. dSe])]. Note that this error 
rate is fixed and applying more pulses at this rate only 
retains it, resulting in an absolute error that grows lin- 
early with time. For a detailed example of this improve- 
ment see Ref. ISJ] where a Carr-Purcell-Meiboom-Gill 
(CPMG) [3^, l36l| sequence is used to decouple the elec- 
tron spin from the nuclei (we give a concatenated version 
of the CPMG sequence in Appendix [C]).^ In a system- 
specific example such as that of Ref. [34], many of our 
conservative bounds on the commutators appearing in 
the effective Hamiltonian can be improved. 

The error rate for CDD, ecDD is given by dividing 
Eq. ([5T|) by T (or more generally by Eq. ([57)) ). which 
decreases super-polynomially with the number of pulses 



as e 
ror phase $. 
eo = $/T is 



^ The CPMG sequence is a simple dynamical decoupling sequence 
consisting of fixed periodic spin flips corresponding to X pulses 
in our notation. See Appendix [C] for a detailed discussion. 
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FIG. 1: Simulation of PDD (square) and CDD (circles) for 
concatenation levels 1 to 8. Results shown are for one minus 
purity of the system qubit as a function of the number of 
pulses. The pulses used are ideal. Note the fantastically high 
purity achieved with CDD. In the case of PDD the horizontal 
axis gives log^ of the total number of pulses, while for CDD 
the numbers on the horizontal axis denote the concatenation 
level. 



used. Note that, while initially we start in a regime 
where (3 <^ J for low levels of concatenation, the effective 
(renormalized) value of J quickly decreases with increas- 
ing concatenation level, and we are in a limit where (3 
(or the effective [Eq. ([56]) ] ) dominates through the un- 
decoupled error term. Finally, we note that as long as 
pulse widths that are negligible with respect to our pulse 
intervals are used, the condition for convergence of con- 
catenation [Eq. (|58p] is satisfied in our example. 

We performed a numerically exact simulation for com- 
paring different pulse sequences for a qubit coupled to a 
small environment. We consider a spin chain of length 
N , where the central spin is from a different species (the 
electron in the dot versus the surrounding nuclei) and 
acts as the system qubit. The (undesired) couplings be- 
tween the spins is given by Heisenberg interaction terms, 
so that the coupling Hamiltonians between two spins i 
and J at a distance d (measured in units of the lattice 
constant) is given by ^{XiXj+YiYj + ZiZj), where c de- 
pends on the species of the spins i and j. We fix the cou- 
pling strengths and the number of spins so that the cou- 
pling strengths roughly correspond to GaAs our example: 
J = IMHz and (3 = lOKHz. The effective errors at the 
end of the concatenated cycles are so low (purity loss of 
around 10~^°° - see Fig. [1]) that we are forced to use 
extremely high precision linear algebra that significantly 
burdens simulation performance. To get practical results, 
we argue based on Lieb- Robinson bounds (see [s^l) that 
within the duration of the simulation, enlarging the spin 
chain will not modify the results. This was verified inde- 
pendently for a test case where there was no significant 



FIG. 2: Simulation of PDD (squares) and CDD (circles) for 
concatenation levels 1 to 8 with non-ideal pulses. The pulse 
widths are set at Ips, lOps, and lOOps. Axes as in Fig. [1] 



qualitative difference between N = 3,5,7. Therefore the 
results presented here are for = 3 and they capture 
the essence of our estimates. We fix the overall dura- 
tion of the pulse sequence at 10~^s so that (3T < 1 [see 
Eq. ((28|) ]. This allows us to concentrate on the inter- 
pulse period and pulse-widths that are the main tech- 
nological challenge for decoupling. Higher concatena- 
tion levels thus correspond to log(number of pulses) [or 
to log (1 /inter-pulse period)]. In Fig.[l] we compare CDD 
and PDD purities for ideal pulses at various levels of con- 
catenation. The vertical axis displays the loss of purity of 
the system qubit, i.e., 1 — Tr[(p5)^], where ps = TrB(p), 
and p is the joint system-bath density matrix at the final 
time T. The higher order data for PDD are obtained by 
simply repeating the basic sequence and shrinking the 
pulse-interval. This is necessary to get the long-term be- 
havior of decoupling. The graphs show a progressive im- 
provement in purity of the initial qubit state, ^"^^"'"^ , as 
a function of concatenation level. The environment part 
of the spin chain is initialized in a thermal state at a tem- 
perature of IK. In Fig. [2] we have depicted the effect of 
realistic pulse widths on decouphng. We note that in the 
case of PDD the performance of the pulse sequence de- 
teriorates with increasing pulse width (as expected) , but 
also (after an initial improvement, as in the ideal pulse 
case) as a function of the number of pulses used. This 
latter deterioration is somewhat surprising, and is due 
to the pulse width errors that simply accumulate over 
time. The point of deterioration shifts to the right as 
the pulse width is made smaller, as expected. The im- 
provement seen for the lOOps case at log4(A^) = 7 can be 
understood as being due to the essentially random-walk- 
on-a-circle-like nature of the accumulated errors, which 
will occassionally result in a recurrence. Concatenated 
sequences are naturally robust against these errors, as 
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can be seen clearly in the figure, where CDD results in 
a saturation of the purity. The asymptotic purity level 
is roughly equal to the square of the per-pulse-error 5, J 
(recall subsection IIII Cp . For example, with 5 — Ips and 
J = IMHz we find ((5J)^ — 10^^^, in agreement with 
Fig. [21 While it is impossible to go to error rates be- 
low per-pulse-error, with concatenation we are able to 
maintain the error rate at this minimum. 



HIGHER ORDER (TROTTER-SUZUKI) 
UNIVERSAL DECOUPLING 



Suppose a universal dynamical decoupling sequence is 
known, i.e., essentially a series of unitarily transformed 
Hamiltonians Hj = PjHeP] such that J^jPjHeP] = 
B ^ Is for some environment operator B. As we saw 
previously, the sequential application of propagators gen- 
erated by these Hamiltonians acts trivially on the system 
only up to the first order in ||i3a||ro where tq is a typ- 
ical free-evolution period duration. Trotter-Suzuki ex- 
pansion allows us to construct a sequence of these prop- 
agators that act trivially on the system up to any order 
n. Suppose {eAj}'^^^ are dimensionless Hermitian oper- 
ators such that ^ Aj — A and e is a small parameter. 
Trotter-Suzuki expansion allows us to find a sequence of 
indices {ni}fLi (n^ G {1 . . . k}) and real numbers {ci}fLi 
such that 



(60) 



The number N of propagators required, scales with e" . 
It is worth emphasizing that while numerous, the se- 
quence of coefficients Ci and indices Ui can be constructed 
recursively [l^[38l|. 

For practical purposes it is advantageous to use a vari- 
ation on this expansion in which the coefficients Ci are 
rational numbers. Once the sequence of propagators is 
known we can construct the Trotter-Suzuki decoupling 
sequence (TSDS) accordingly: Set up Aj = iHj and 
A = iB (g) I based on the universal DD cycle and em- 
ploy the following DD sequence using the smallest pulse 
switching time Tmin available: 

(61) 

where f [r] denotes a free evolution period of duration 
T. By dimensional analysis, the error in Eq. (|60p . e", 
translates into left-over terms asymptotically bounded 
by products of operators Ba and is thus bounded by 
0[(Ti„in max ||_Bc(||)"]. We can thus asymptotically esti- 
mate the error phase as: 

^-TSDS = O [(r„iinmax{||Ba||})"] 



= O [(rmax{||S„||}/iV)^/^J , (62) 

A serious problem is that negative Ci 's routinely appear 
in the TSDS (except for n = 3). Clearly, this presents a 



DD Scheme 


$ for J < /? 


$ for /3 < J 


Periodic DD 


T'^PJ/N 


T'^J'^/N 


Concatenated DD 


(/3T/A/-i''2)i°g4iV(jT) 




Trotter-Suzuki 


(/3r/iV)Vi°S4JV 


{JT/N)V^°S4N 



TABLE I: Asymptotic comparison of the error phase $ for 
deterministic decoupling schemes. T is the total experiment 
duration, A'' the total number of pulses, J and /3 are defined 
in Eq. (flHll. 



major problem since one cannot have negative times in 
the free evolution segments in Eq. ((6T|) . For this reason, 
as presented the TSDS is not physically implementable. 
A solution for this problem is not yet available to us, but 
does not seem impossible. 

Table jH summarizes the asymptotic performance of 
FDD, CDD and TSDS in the hmit of large number of 
pulses and the two regimes of J < /? and /3 <C J. 

The TSDS performs remarkably well and is oblivious 
to the distinction between pure-bath and system-bath 
dynamics. Nonetheless, its performance is sensitive to 
small errors in pulse operation and the switching times 
that need to be precisely set (not required in FDD and 
CDD and randomized decoupling schemes) . Thus it may 
not be a robust alternative to CDD, in spite of its supe- 
rior convergence properties, but it may be used as a basic 
dynamical decoupling sequence at the base of a new con- 
catenated pulse sequence. This is particularly useful with 
the TSDS at n = 3. 

Finally, we note that the Trotter-Suzuki expansion was 
also recently used by Brown et al. in a study of arbitrarily 
accurate composite pulse sequences [s^. There the goal 
was to overcome systematic errors in the system control 
Hamiltonian, without considering decoherence. 



VI. DECOUPLING WITH VERY NARROW 
PULSES CANNOT INCREASE ERROR NORMS 

In this section we return to DD with ideal, zero width 
pulses, and argue that such DD sequences can never effec- 
tively strengthen the undesired terms and cannot cause 
extra errors. We then argue that even with finite-width, 
but sufficiently narrow pulses, error norms cannot in- 
crease under DD. These results are of independent in- 
terest and apply to any pulse-based error suppression 
strategy, including closed-loop quantum error correction. 

Consider a sequence of ideal unitary operations Pi ap- 
plied to a quantum system (measurements can also be 
included by enlarging the Hilbert space), and suppose a 
sequence of intervals Ti separates these operations, such 
that the pulse sequence is given by: Pif ti^2- ■ • f T„_i^n- 
In the absence of environmental couplings, the overall 
effect of the pulse sequence would be given by Qn = 
Pi ... P„ - the "ideal operation" . But in the presence of 
the environmental couplings in He, the overall propaga- 
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tor U is modified by error terms. Letting Qi = Pi . . .Pi, 
we can define an effective unitary error operator corre- 
sponding to the whole sequence in the following manner: 

U = Pie-"'"^P2e-"-'''P3 . . . Pn-ie'"""^Pn 

= g-iriQl-HeQlg-ir2Q2HeQ| g-iTiQ„_i_f/eQl_iQ^ 

= :expi^iTH'jPi...Pn (63) 

Note that in the last line we have defined the over- 
all duration T = '^Ti, and isolated the effective error 
Hamiltonian H'^ from the desired, ideal unitary opera- 
tion Qn = Pi . . . Pn- 

As before, we use a unitary operator norm ||.|| to com- 
pare the strength of the error Hamiltonians He and H'^. 
Unitary operator norms [IHl, such as the absolute dif- 
ference between the largest and smallest eigenvalues, are 
invariant under unitary transformations (for unitary U , 
\\UAU^\ = \\A\\) and can be used as measures of fidelity 
errors in Hamiltonian error correction theory (3l| . We 
now use the following existential theorem due to Thomp- 
son [iO, llj: Let j4i, A2 be Hermitian matrices. Then 
there exist unitaries Ui and U2 and a Hermitian matrix 
A such that 

g«Aig»A2 ^ g^A. ^ ^ UiAiUl + U2A2UI 

This theorem can be extended (induction on the num- 
ber of exponentials) to products involving more than 
two matrix exponentials: e'"^"^ • ■ • e*"*" = e*'^. Using 
Eq. ([55]) and Thompson's theorem we have: TH'^ — 
'^iTiUiQiHeQ\u} , and we have the following inequal- 
ity for the norm of H'^: 

i 

<^Y.^^\\U^Q^HeQ\ul\\ = \\He\\ (64) 

i 

Thus the norm of the effective error Hamiltonian does 
not increase under the action of ideal unitary operators. 

In the case of non-ideal pulses carrying systematic er- 
rors per pulse (e.g., due to finite pulse- width), we can 
model the pulse error as a unitary error operator imme- 
diately preceding the ideal pulse. For single-qubit pulse 
errors due to finite pulse widths, one can again show that 
[L^ell — ll^ell provided the pulse widths are small enough 
[42 | . This allows us to use Thompson's theorem to show 
that our argument in this section also applies to pulses 
of sufhciently narrow width on a single qubit. We ex- 
pect this argument to apply to multi-qubit near-perfect 
operators in the presence of a bounded bath. 

As a special case, this argument applies to dynam- 
ical decoupling. While positively reassuring that with 
ideal pulses the undesired couplings do not increase in 
strength, the present argument does not quantify the ef- 
ficiency of dynamical decoupling. For this purpose we 



employed, above, approximations based on the Magnus 
expansion. 

The same argument applies in a quantum error cor- 
rection codes setting 14311 . in p articular in non-Markovian 
fault-tolerance theorv |3]]. |44|. 14^ . where a "time-resolved 
fault path" expansion has recently been used to decom- 
pose the action of errors in the course of a general evo- 
lution. Our argument can be used to further rational- 
ize such expansions based on the fact that errors are 
well-behaved (in the sense of \\H'J\\ < ||i?e||) in the non- 
Markovian regime. 



VII. SUMMARY AND DISCUSSION 

Dynamical decoupling (DD) cannot be exactly ana- 
lyzed without concrete reference to the details of the 
system-environment coupling, but an abstract picture of 
the interaction in terms of bounded environment opera- 
tors - as pursued here - can yield useful performance 
estimates. Within this framework, we have provided 
an analytic estimate of the leading order decoupling er- 
ror associated with the basic universal decoupling cycle 
for a qubit. We have analyzed and compared the per- 
formance of periodic DD (FDD) and concatenated DD 
(CDD) schemes. We have provided detailed calculations 
supporting the conclusion reported in [T?] , that CDD sig- 
nificantly outperforms FDD within practical boundaries 
of pulse parameter space. We have distinguished between 
two different limiting cases of fast versus slow environ- 
ment dynamics. Fast dynamics of the environment limits 
the performance of higher order deterministic dynamical 
decoupling. This can be understood from the interac- 
tion picture, where the system-bath interaction Hamil- 
tonian is fast fluctuating. Slow bath dynamics, on the 
other hand, can be exploited by CDD to result in super- 
exponential decoupling using an exponential number of 
pulses. Table U provides a convenient summary of the rel- 
ative performance of FDD and CDD, as well as the new 
Trotter-Suzuki based pulse sequence we have introduced. 

Our discussion was based on a pulsed control mode, 
but it is known that higher fidelities are possible via fine- 
tuned navigation of the control Hamiltonian [23|. This 
direction can be especially useful when decoupling meth- 
ods are to be used not for quantum state preservation 
but for performing an error-corrected quantum evolu- 
tion. Dynamically error corrected evolution can also be 
achieved via a hybrid decoupling error correction method 
[H, ESi 13 • Consider a stabilizer quantum error correct- 
ing code characterized by a stabilizer group of Fauli op- 
erators [49I [50| . Assume that this code corrects all single 
qubit errors, which means that each single qubit error an- 
ticommutes with at least one stabilizer generator. This 
anti-commutation condition translates into time reversal 
of the error, when exponentiated. Thus it is possible to 
use the generator set of the stabilizer to form a univer- 
sal decoupling sequence for decoupling single qubit error 
terms on the code space. In this way decoupling inte- 



13 



grates seamlessly with the encoded quantum operations 
on the code space generated by Hamiltonians written as 
the sum of normalizer elements of the code. This also al- 
lows us to perform quantum error detection and recovery 
within a hybrid decoupling-error correction setting, thus 
allowing the correction of errors in both the Markovian 
and the non-Markovian regimes |42] . 

The open-loop approach of DD appears at first sight 
to be conceptually and practically very different from 
the method of closed-loop quantum error correcting 
codes [5l|. However, recent results on the theory of 
non-Markovian fault tolerant quantum error correction 
(FTQEC) [m Si, [H suggest that the error bounds ap- 
plicable in the fault tolerant, concatenated version of DD 
are in fact very similar to the bounds relevant to FTQEC. 
Specifically, in both CDD and FTQEC it is essential 
that the system-bath interaction Hamiltonian is norm- 
bounded (see Refs. (s^, [S^ for a critique of this assump- 
tion) . In light of the much smaller degree of overhead in- 
volved in CDD, this suggests that in the non-Markovian 
regime one can profit significantly by incorporating CDD 
into a closed-loop QECC procedure. CDD can then re- 
move the leading order bath-induced errors, while the 
QECC procedure can target primarily the random con- 
trol errors for which CDD offers only limited protection. 

We note that it is possible to view the CDD proce- 
dure as a discrete time dynamical system, whose ideal 
fixed point is a vanishing system-bath interaction. In 
this manner it should be possible to characterize the re- 
gion of correctable errors using tools from the analysis 
of fixed points, and to incorporate perturbations of the 
pulse sequence. An analysis of CDD from this perspec- 
tive may well be a fruitful endeavor. Indeed, there exists 
a dynamical maps approach to concatenated quantum 
error correction, which has proven to be very convenient 
in the analysis of the fault tolerance threshold [13, [HI ■ 



As a final comment we should emphasize that the abil- 
ity to perform arbitrarily precise Hamiltonian control on 
the system Hamiltonian assumes a classical control mech- 
anism, while every control system is really quantum in 
nature. Thus besides technological constraints, funda- 
mental quantum fluctuations may limit the performance 
of DD as well, since perfect classical control simply does 
not exist. A systematic characterization of bounds on the 
fidelity of feedback-free error correction schemes such as 
DD, imposed by fundamental quantum fluctuations, is 
still an important open question. 
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APPENDIX A: VALIDITY OF THE MAGNUS 
EXPANSION 

We analyze the convergence domain of approximations 
and the dynamical renormalization process, specifically 
as it applies to CDD. This recursive renormalization can 
be written as: 

4 

f/("+i) := exp(-*r„+ii/("+i)) = [] cxp(-ZT„i/j")) 

(Al) 

where t„+i = 4t„, i.e., 

t-„=4"to, (A2) 

and where 





= 4"^ ^ 


E>I + 








5 Y 


+ 4"^ ^ 
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E>I + 
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- bP a 




= XHj^'^X, 


rrin) 
^3 








»X 


+ s(") $ 


5 Y 


- bP a 




= YH^'^^Y, 




= 4"^ ^ 






?)X 


- sir) ^ 


5 Y 


+ si"^ $ 




= ZHi"'>Z, 



(A3) 



are the recursive generalization of Eqs. ([8]). The Magnus 
expansion of C/*^"-' yields: 

oo 

f/(") = exp{-iTnH^"'> ) = exp(^ ^ ) , (A4) 



whence 



,i/(") = + 4")] + T„C("\ (A5) 



where 



(n-1) 



l=i<i=4 



(A6) 
(A7) 



i=3 



1=3 



i commutators 



(A8) 
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where is a numerical factor determined by explicit 
computation of the ith order Magnus expansion, and 

C(n) 

is an operator-valued correction to the second or- 
der Magnus expansion. 

(n) 

We would like to find an approximation for the Ba ■ 
To do so we will first show that it is consistent to use the 

(n) 

second order Magnus expansion for He , in the sense 
that 

if (") = ii (") « A (^(") + ) = : ^-"^ ® ' 

^" a=G,X,Y,Z 

m 

where we can safely neglect C'-"' [i.e., all ^j-"2] '^■^ ^""^ff 
as 

T„/3 « 1. (AlO) 

This is the recursive generalization of the result obtained 
above for n = 1. Then the i?!"-* will be the desired ap- 
proximation to The proof is by induction. We will 
require the following inequalities, satisfied for bounded 
operators A and B [2a| : 

||[Ai?]|| < 2||A||||i?||, (All) 
\\m < \\A\\\\B\\, (A12) 
\\A + B\\ < ||A|| + ||B||. (A13) 



Lemma 1 The following relations hold: 



||C(")||«-||4")|| (A14) 



and 

\\Bi"^\\ = 0{(3), a = X,Y,Z (A15) 
B^"' - So. (A16) 

Proof We prove the lemma by induction. Let us call 
Eq. (lUill "a(n)", Eq. ((1151) "ba(n)", and Eq. (IUHl) 
"bo(n)". We have already established the case a(l) 
[Eq. p9)) ]. and bQ,(l) and bo(l) are based on our defini- 
tions and assumptions. We will show that (1) ha{n — 1) 
& bo(n- 1) ^ a(n), (2) bo(ri- 1) ^ bo(n), and then (3) 
a(n) & bQ,(n — 1) \ia(n). Recall all along that we have 
assumed J < [3. 



(1) Proof of a(n): We have, using Eqs. (|T7l) . ((X8|) and 
inequality (jAlip 



r„||C(")||<^m,r:_i||[i7j"-^\[ij("-i\...]]|| = E 1^"'^ 1 1 H^l""'^ 1 1 ' ' ' )l 1 1 



i=3 



ba(n-l) & bo(n-l) 



i commutators 
/ oo 



i—2 terms 



O E<-i/3'"'P2"^ll " = 0(r„_i/3||4"^||) « ||4"^| 



(2) Proof of bo (n): 



,(«) 



-«T„_1 



(n-l) 



-^4r„_lB;,"^^^ 0/ 



(3) Proof of ba(n): From Eq. (|A9|) we have 



=o.x,y,z 



a=O.X,Y.Z 



Since ^2"'' contains no pure-environment terms it determines the part contributing to the sum over a ^ X,Y, Z: 



l=i<J=4 



Ineq. iXT]l -l lAT3l 



,(ri-l)|| II rr(n-l)|| bc(n-l) 



1=^=4 

Since the system operators all have [[S'q, || = 0(1) and the environment operators all have similar norm, we can 

conclude that, as required, = 

The upshot of this proof is the following: 
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Corollary 2 The recursive second-order Magnus expansion, Eq. liA9\) . is a valid approximation provided we assume 

TnP < 1. 

The condition t„/3 ^ 1 of course puts a physical upper limit on the number of levels of concatenation. Provided 
this condition is satisfied, it follows that, schematically, we have 



T2 



'n. 



(n) 



Sa- 



(A17) 



r 



Note that in the body of the paper, for notational sim- 
plicity we dropped the tilde, with the convention being 
that we are only considering the environment operators 
defined by the second-order Magnus expansion. 



APPENDIX B: FINITE PULSE WIDTH 
ANALYSIS FOR CDD 

For brevity define 

II II, Pa.:=P^°\ /?:=/3o, (Bl) 

and use Ineqs. (|Alip - (jA13|) to reproduce the recursive 
inequalities corresponding to /3i"'' from Eqs. ([29l) - (f32|) : 



/3^"^ = 13, 



(B2) 

-^{bx + -l3Y) (B3) 

Tn-l ^ TT 



1 5 



+ 



(B4) 



4") < 6 



X 



Tn-1 



-132 



A necessary condition for convergence is (3a ^ < = (3a 
for a = X,Y, Z. Let us define constants a, b such that 

(3y = a(3x, (3z = h(3x. (B6) 
Then we have for the n = 1 case of Eq. (|B3p 

<2(ro-<5)/3/3^ + -(i + -)/3x, (B7) 

To Z TT 

which must be smaller than (3x ■ We thus find the neces- 
sary condition 

2{t^-5)(3+-{\ + -)<!. (B8) 

To 2 TT 



Next consider the n = 1 case of Eq. (|B5|) , and set it to 
be smaller than Pz- 



pflPz < iS(3x) 



-(1 



< 1. 



To 



(B9) 



Both quantities SPx , S /tq are ^ 1 by our previous as- 
sumptions, so this inequality is automatically satisfied. 
Finally, consider the n = 1 case of Eq. (IB4p . and set it 
to be smaller than (3y'- 



P^y^/(3y < iTo~S)(3+iTo-2S)-Px + --- < 1. (BIO) 

a IT To a 

We can simplify these results somewhat, as follows. We 
can replace Px hy J — max(||_Bx|| , ||-By|| , \\Bz\\) < P 
[Eq. (|18p ]. and assume for simplicity a = b = 1. We can 
then replace Ineqs. ((B8l) . (|B10|l by 

2{to-S)P+(1 + -)- < 1 (BU) 

2 TT To 

{To-d){p + J)-5J+-— < 1. (B12) 

TT To 

We can safely replace J by /3 (since J < P) and drop 5 J . 

1 s 



1 5 b 



(B5) This turns the second inequality into 2 (ro—^)/3-f;^^ < 1 



so it is subsumed by the first inequality. 

APPENDIX C: CONCATENATION OF THE 
CPMG PULSE SEQUENCE 

Consider an error Hamiltonian He of the form 
He ^ B^(g> Z + Bo® Is. 

A simpl e dy namical decoupling sequence (also known as 
CPMG (ssLIsgII) can decouple this error Hamiltonian: 

pi = XfXf 
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where f denotes a free evolution interval of duration t. 
The propagator corresponding to this sequence is given 
by: 

jj _ ^-iT{-B^(»Z+Bo(g>Is)^-iT{B^(g)Z+Bo»Is) 

A simple application of the Baker-Campbell-Hausdorf 
(BCH) [HBl expansion shows that U can be written as 

JJ ^ ^-i2T{Bo(g,Is+F[Bo,Bz](g>Z) 

where F[Bo^ Bz] is a Hermitian operator in the Lie sub- 
algebra generated by Bq and Bz- One can show that 



\\F[Bo,Bz]\\ = 0{\\Bo\\\\Bz\\t) in the limit of r ^ 0. 
We can thus use the same sequence for decoupling the 
undecoupled term F[Bo, Bz]® Z and construct 2nd and 
3rd order CDD sequences: 

P2 = f Xf f Xf (CI) 
P3 = Xf Xf f Xf Xf Xf f Xf (C2) 

Notice that concatenated CPMG requires far fewer pulses 
than concatenated universal DD. However, CPMG is not 
as robust with respect to systematic errors in the pulses 
as concatenated universal DD. 
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